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Abstract. In this paper, we establish a uniform error rate of a Bahadur representation 
for local polynomial estimators of quantile regression functions. The error rate is uniform 
over a range of quantiles, a range of evaluation points in the regressors, and over a wide 
class of probabilities for observed random variables. Most of the existing results on Bahadur 
representations for local polynomial quantile regression estimators apply to the fixed data 
generating process. In the context of testing monotonicity where the null hypothesis is of 
a complex composite hypothesis, it is particularly relevant to establish Bahadur expansions 
that hold uniformly over a large class of data generating processes. In addition, we estab¬ 
lish the same error rate for bootstrap local polynomial estimators which can be useful for 
various bootstrap inference. As an illustration, we apply to testing monotonicity of quantile 
regression and present Monte Carlo experiments based on this example. 
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1. Introduction 

In this paper, we establish a Bahadur representation of a local polynomial estimator of 
a nonparametric quantile regression function that is uniform over a range of quantiles, a 
range of evaluation points in the regressors, and a wide class of probabilities underlying the 
distributions of observed random variables. We also establish a Bahadur representation for 
a bootstrap estimator of a nonparametric quantile regression function. 

There are several existing results of Bahadur representation of the local polynomial quan¬ 
tile regression estimator in the literature. Chaudhuri (1991) is the classical result on a local 
polynomial quantile regression estimator with a uniform kernel. His result is pointwise in 
that the representation holds at one quantile, for a fixed point, and for a given data generat¬ 
ing process. For recent contributions that are closely related to this paper, see Kong, Linton, 
and Xia (2010), Guerre and Sabbah (2012), Kong, Linton, and Xia (2013), and Qu and Yoon 
(2015), among others. Kong, Linton, and Xia (2010) obtain a Bahadur representation for a 
local polynomial M-estimator, including the quantile regression estimator as a special case, 
for strongly mixing stationary processes. Their result holds uniformly for a range of eval¬ 
uation points in the regressors but at a fixed quantile for a given data generating process. 
Guerre and Sabbah (2012) obtain Bahadur representations that hold uniformly over a range 
of quantiles, a range of evaluation points in the regressors, and a range of bandwidths for 
independent and identically distributed (i.i.d.) data. However, their result is for a fixed data 
generating process. Kong, Linton, and Xia (2013) extend to the case when the dependent 
variable is randomly censored for i.i.d. data and obtain the representation that is uniform 
over the evaluation points. Qu and Yoon (2015) consider estimating the conditional quantile 
process nonparametrically for the i.i.d. data using local linear regression, with a focus on 
quantile monotonicity. They develop a Bahadur representation that is uniform in the quan¬ 
tiles but at a fixed evaluation point for a given data generating process. It seems that our 
work is the first that obtains a Bahadur representation that holds uniformly over the triple: 
the quantile, the evaluation point, and the underlying distribution. However, our result is 
for a fixed bandwidth, unlike Guerre and Sabbah (2012). 

The most distinctive feature of our result is that the Bahadur representation is uniform 
over a wide class of probabilities. Uniformity of asymptotic approximation in probabilities 
has long drawn interest in statistical decision theory and empirical process theory. Uniformity 
in asymptotic approximation is generally crucial for procuring finite sample stability of size 
or coverage probability in inference. See Andrews, Cheng, and Guggenberger (2011) for 
its emphasis and general tools for uniform asymptotic results. In the recent literature of 
econometrics, identifying the class of probabilities for which the uniformity holds, and their 
plausibility in practice, have received growing attention, along with increasing interests in 
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models based on inequality restrictions. See, for example, Andrews and Soares (2010) and 
Andrews and Shi (2013) among many others. 

To see the issue of uniformity, consider a simple testing problem: 


. . TT w dMed [A IA = x] tt ^ <9Med [Y\X = x] 

(1.1) Hq : \/x G A, ---- > 0 vs. Hi : 3x 6 A, ---- < 0, 

Ox ox 

where Med [Y\X = x\ is the conditional median of Y given X = x and A is a region of inter¬ 
est. Then one may develop a nonparametric test statistic using the local polynomial quantile 
regression estimator (e.g. the L p statistic as in Section [3]). The behavior of this nonparamet¬ 
ric test statistic depends crucially on the contact set 5 := (i 6 A : <9Med [Y\X — x]/dx — 
0}. To emphasize the issue of uniformity, consider a sequence of data generating processes 
indexed by n. For example, we take the sequence of the true conditional median functions to 
be Med n [Y|A = x\ = x 3 /n on A = [—1,1], Then for each n, the corresponding contact set is 
a singleton set, that is B n = {0}; however, Med n [Y\X — x\ — x 3 jn converges to 0 uniformly 
in x G A as n —>■ oo. In other words, as n gets large, the true function looks flat on A, 
but the population contact set is always the singleton set at zero for each n. This suggests 
that the pointwise asymptotic theory may not be adequate for finite sample approximation. 
Therefore, it is important to develop uniform asymptotics for the local polynomial quantile 
regression estimator by establishing the Bahadur representation that is uniform over a large 
class of probabilities. 

We illustrate the usefulness of our Bahadur representation by applying it to testing mono¬ 
tonicity of quantile regression that includes ( 1.1| ) as a special case. Our proposed test uses 
the framework of Lee, Song, and Whang (2015, LSW hereafter). They provide a general 
method of testing inequality restrictions for nonparametric functions, and make use of this 
paper’s result in establishing sufficient conditions for one of their results. 

The remainder of the paper is as follows. Section [2] presents the main results of the paper, 
Section [3] gives an application of our main results in the context of testing monotonicity, 
Section [4] concludes, and Section [5] gives all the proofs. 


2. Uniform Asymptotics 

This section provides uniform Bahadur representations for local polynomial quantile re¬ 
gression estimators and considers their bootstrap version as well. 

2.1. Uniform Bahadur Representation for Local Polynomial Quantile Regression 
Estimators. In this subsection, we present a Bahadur representation of a local polynomial 
quantile regression estimator that can be useful for a variety of purposes. 

Let (B T ,X T ,L) t , with B = (Bi, ...,Bi) t e R^, and A" e R d , be a random vector such 
that the joint distribution of (B T , A" T ) T is absolutely continuous with respect to Lebesgue 
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measure and L is a discrete random variable taking values from = { 1 , 2 , L}. For each 
x £ and k £ N L , we assume that the conditional distribution of B L given (X, L) = (. x , k) 
is the same across l = 1, k. It is unconventional to consider a vector B, but it is useful to 
do so here to cover the case where we observe multiple outcomes from the same conditional 
distribution. 

Let q k (r\x) denote the r-th quantile of Bi conditional on X = x and L — k, where 
r G (0,1). That is, P{Bi < q k (r\x)\X = x, L = k} = r for all x in the support of X and all 
k £ {1, We write 

Bi = q k {r\X ) + e T j k , r £ ( 0 , 1 ), for all k £ { 1 , L}, 

where e T) i k is a continuous random variable such that the r-th conditional quantile of e T) i k 
given A" and L = k is equal to zero. 

Suppose that we are given a random sample {(Bj, Xj, Lj ) T }” =1 of (B T , X T , L) T Q Assume 
that q k (r |x) is (r + l)-times continuously differentiable with respect to x, where r > 1. We 
use a local polynomial estimator q k (r\x), similar to Chaudhuri (1991). For u = (u\,... ,Ud ), 
a d-dimensional vector of nonnegative integers, let [w] = u± + ■ ■ ■ + Ud. Let A r be the set of 
all d -dimensional vectors u such that [it] < r, and let \A r \ denote the number of elements 
in A r . For z = (zi ,..., Zd) T £ R cZ with u = (ui ,..., Ud) T £ A r , let z u = nl=i z rrT■ Now 
dehne c(z) = ( z u ) u& A r , f° r z G R d - Note that c(z) is a vector of dimension \A r \. For 
u = (mi, ...,Ud) T £ A r , and r + 1 times differentiable map / on R d , we dehne the following 
derivative: 

where [u] = U\ + • • • + Ud- Then we dehne 7 Ti a,(x) = (' y T ,k,u(x)) ueAr , where 

7 r ,k,u(x) = —- :D u q k (r\x). 

ud ■ • • u d \ 

We construct an estimator ^ r , k (x) as follows. First, we dehne for each 7 G Rl Ar l, 

n Li 

i=1 t= 1 

Then we construct 
( 2 . 1 ) 7 r,ki x ) = argmin 76R | A r \S ntXjTtk ('j), 

where l T [u) = u[r — l{w < 0}] for any u £ R, K k (t) = K(t/h)/h d , K is a d-variate kernel 
function, and h is a bandwidth that goes to zero as n —» 00 . 

1 In fact, the estimator allows that we do not observe the whole vector Bi, but observe only Bu,B k i 
whenever Li = k. 


S n ,x,r,k (7) = l { Li = 
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In order to reduce the redundancy of the statements, let us introduce the following defi¬ 
nitions. 


Definition 1. Let G be a set of functions g v : R m —» R s indexed by a set V, and let S C R m 
be a given set and for £ > 0, let Sfie) be an ^-enlargement of S v = {x G S : ( x,v ) G S' x V}, 
i.e., Sfie) = {x + a : x G S and a G [— e,e] m }. Then we define the following conditions for Q: 

(a) B(S', e): g v is bounded on S v (e ) uniformly over v G V. 

(b) BZ(S', e): g v is bounded away from zero on S v (e) uniformly over v G V. 

(c) BD(S', £,r): Q satisfies B(S', e) and g v is r times continuously differentiable on S v (e ) with 
derivatives bounded on S v (e) uniformly over v G V. 

(d) BZD(S', e,r): Q satisfies BZ (S,e) and g v is r times continuously differentiable on S' 1 ,(e) 
with derivatives bounded on S v (e) uniformly over v G V. 

(e) LC: g v is Lipschitz continuous with Lipschitz coefficient bounded uniformly over v G V. 


Let V denote the collection of the potential joint distributions of (B T , X T , L) T and define 
V = T xP, and for each fc G N^, 


( 2 . 2 ) 

Gfik) = 

{«.-(t|,) : (t,P) e v}, 



G f (k) = 

IM-ITU-PJeV}, 



G L (k) = 

{P {L. — k\Xj — ■} : P G V} , 

and 


Qf = 

{/(•): P 6 P}, 


where f T ,k(0\x) 

being the conditional density of Bn — quij\Xf) 

given Xi 

Also, define 




(2,3) 

G f ,2(k) = {/.,*(•!• 

) : P ev} and Gfik) = { 7 . )fc (-} 

:PeV} 


In other words, Qf^(k) is the class of conditional densities f T ,k(-\x) indexed by r, x, and 
probabilities P, and Q^k) is the class of functions 7 r,fc(') indexed by r and probabilities P. 
We make the following assumptions. 


Assumption QR1. (%) Qf satisfies BD(S,e, 1). 

(ii) For each k G Nl, Gf{k ) and Gh{k) satisfy both BD(S,e, 1) and BZD(S,e, 1). 

(in) For each k G Ni, G q (k) satisfies BD(S,e,r + 1) for some r > 1. 

(iv) For each k G Nl, Gf, 2 (k) and Gfik) satisfy LC. 

Assumptions QRl(i) and (iii) are standard assumptions used in the local polynomial ap¬ 
proach where one approximates qk(\x) by a linear combination of its derivatives through 
Taylor expansion, except only that the approximation here is required to behave well uni¬ 
formly over PgP. Assumption QRl(ii) is made to prevent the degeneracy of the asymptotic 
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linear representation of 7 T ,k( x ) — 7 T ,k( x ) that is uniform over x G d> r (e), r G T and over 
P G V. Assumption QRl(iv) requires that the conditional density function of Bn — q k {r\Xi) 
given Xi = x and L, = k and 7 r ,fc(-) behave smoothly as we perturb r locally. This re¬ 
quirement is used to control the size of the function spaces indexed by r, so that when the 
stochastic convergence of random sequences holds, it is ensured to hold uniformly in r. 

Let 11 • 11 denote the Euclidean norm throughout the paper. Assumption QR2 lists condi¬ 
tions for the kernel function and the bandwidth. 

Assumption QR2. (i) K is compact-supported, bounded, and Lipschitz continuous on the 
interior of its support, f K(u)du = 1, and f K ( u ) ||u|| J du > 0. 

(ii) As n — y 00 , n~ x l 2 h~ d l 2 log n + \/nh d h r+1 / \/log n —> 0, with r in Assumption QRl(iii). 


Assumption QR2 gives conditions for the kernel and the bandwidth. The condition for 
the bandwidth is satisfied if we take h = Cn~ s for some constant C with s > 0 satisfying 
that l/(d + 2 (r + 1)) < s < 1/d. 

For any sequence of real numbers b n > 0, and any sequence of random vectors Z n , we say 
that Z n /b n —>p 0, V- uniformly, or Z n = o P (b n ), P-uniformly, if for any a > 0, 

lim sup sup P {11 Z n \ | > ab n } = 0. 

n—>00 Pg'P 

Similarly, we say that Z n = Op(b n ), V- uniformly, if for any a > 0, there exists M > 0 such 
that 

lim sup sup P {11 Z n \ | > Mb n } < a. 

n—>00 PEP 

Below, we establish a uniform Bahardur representation of Vnh d H(^ T ,k{x) — r ) T ^{x)), where 
H = diag((hJ u l) u&A r ) is the \A r \ x \A r \ diagonal matrix. First we introduce some notation. 
We define 

A x,r,ik,i = Bn 7 Tjki.x^c^Xi x), 

Ch,x,i = C (( Xi - x)/h) , and K hjXti = K (( Xi -x)/h). 

Let 

M n)T ,fe(x) = k J P{Lj = k\Xi = x + th}f Tj *;( 0 |x + th)f{x + th)K(t)c(t)c T (t)dt. 

Theorem 1 . Suppose that Assumptions QR1-QR2 hold. Then, for each k G N L , 
sup Vnh d H(% k (x) - Tr,k{x)) - M~^ k (x) (■ ip n ,x,T,k ~ E ip n , x , T ,k) 

t£T,x£S t (e) 
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where, with l T (x) = t — l{x < 0}, 

n Li 

^ ' 1 { h t k{ ^ y lr (A x,r,lk,i ) C-h,x,iL^h,x,i- 

i= 1 1=1 

The proof in this paper uses the convexity arguments of Pollard (1991) (see Kato (2009) for 
a related recent result) and, similarly as in Guerre and Sabbah (2012), employs the maximal 
inequality of Massart (2007, Theorem 6 . 8 ). The theoretical innovation of Theorem 1 is that 
we have obtained an approximation that is uniform in ( x , r) as well as in P. See Remark 1 
below for a detailed comparison. 

Remark 1. The main difference between this paper and Guerre and Sabbah (2012) is that 
their result pays attention to uniformity in h over some range, while our result pays atten¬ 
tion to uniformity in P. Also it is interesting to note that the error rate here is a slight 
improvement over theirs. When d — 1, the rate here is Op[\/\ogn/ (n 1 / 4 /?. 1 / 4 )] while the 
rate in Theorem 2 of Guerre and Sabbah (2012) is Op [log 3 / 4 nj (n 1 / 4 /! 1 / 4 )]. The difference 
is due to our use of an improved inequality which leads to a tighter bound in the maximal 
inequality in deriving the uniform error rate. For details, see the remark after the proof of 
Theorem 1 in the appendix. 



The summands in the asymptotic linear representation form in Theorem 1 depend on the 
sample size and are not centered conditional on Xj. While this form can be useful in some 
contexts, the form is less illuminating. We provide an asymptotic linear representation that 
ensures this conditional centering given X t . 


Corollary 1. Suppose that Assumptions QR1-QR2 hold. Then, for each k G PR, 


sup 

t£T,x£S t (e) 


Vnf?H(% k (x) - lr,k{x)) - M n ^ k {x)^ ntXiTtk 


f log 1/2 n \ 

P l rPW/ 4 J 


V-uniformly. 


where 


'fn,x,r,k — 


\Jnh d 


n 

£ 


L 


1 {Li k{ / ^ l T ( pr,lk,i ) ^h,x,i^h,x,i 


i=i 


Note that the quantity M nTk (x) in the representation can be replaced by 
M T ,k(x ) = k f P{Li = k\Xi = x}f Ttk (0\x)f(x)K(t)c(t)c T (t)dt, 
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if we modify the conditions on kernels and the smoothness conditions for the nonparametric 
function P{L, : = k\Xi = x}f T> k(0\x)f (x). As this modification can be done in a standard 
manner, we do not pursue details. 

2.2. Bootstrap Uniform Bahadur Representation for Local Polynomial Quantile 
Regression Estimator. Let us consider the bootstrap version of the Bahadur representa¬ 
tion in Theorem 1. Suppose that {(Fj* T , A * T )}” =1 is a bootstrap sample drawn with replace¬ 
ment from the empirical distribution of {(Y) T , Xj )" =1 }. Throughout the paper, the bootstrap 
distribution P* is viewed as the distribution of (Y*, Af*)" =1 , conditional on (Y^,Afj)” =1 , and 
let E* be expectation with respect to P*. 

We define the notion of uniformity in the convergence of distributions under P*. For 
any sequence of real numbers b n > 0, and any sequence of random vectors Z*, we say that 
Z*/b n -Ap* 0, P-uniformly, or Z* = op*(b n ), P-uniformly, if for any a > 0, 

limsup sup P {P* {||Z*|| > ab n } > a] = 0. 

n—>oo PdL'P 

Similarly, we say that Z* = Op*(b n ), P-uniformly, if for any a > 0, there exists M > 0 such 
that 

limsup sup P {P* {||Z*|| > Mb n } > a} < a. 

n—>oo PE.V 

For z = (x, t ) G Z, define A*_ T) j fc)i = Y{- ~^ Tk (x)c{X*-x), and let c* h x i and K* hx i be c h<Xti 
and Kh,x,i except that X b is replaced by X*. Then the following theorem gives the bootstrap 
version of Theorem 1. 


Theorem 2. Suppose that Assumptions QR1-QR2 hold. Then for each k G Nj, 


sup 

(x,t)£Xi xT 


VnJ?H{ % k (x) - 7r, k (x)) - M n ^ k (x) «, x , T)fc - E*fj* n ^ k ) 


= O r 


f log 1//2 n \ 
\n 1 / A h d P ) 


, V-uniformly. 


where ip* n ^ k = E?=i H L i = k } Ef=i ~ l r (■K,r,ik,i ) c h,x,i K h,x,i- 


Theorem 2 is obtained by using Le Cam’s Poissonization Lemma (see Gine (1997, Propo¬ 
sition 2.5)) and following the proof of Theorem 1. The bootstrap version of Corollary 1 
follows immediately from Theorem 2. 


3. Testing Monotonicity of Quantile Regression 

This section considers testing monotonicity of quantile regression. We first state the 
testing problem formally, give the form of test statistic, verify regularity conditions, and 
present results of simple Monte Carlo experiments. 
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3.1. Testing Problem. Let q(r\x) denote the r-th quantile of Y conditional on A" = x, 
where r G (0,1) and X is a scalar random variable. In this subsection, we consider testing 
monotonicity of quantile regression. Define g T {x) = dq(r\x)/dx. The null hypothesis and 
the alternative hypothesis are as follows: 


(3.1) 


:g r (x) < 0 for all (r, x) G T x X against 
Hi : g T (x) > 0 for some (r,x) G T x X, 


where X is contained in the support of X and T C (0,1). The null hypothesis states that 
the quantile functions are increasing on X for all r € T, and the alternative hypothesis is 


the negation of the hypothesis. If T is a singleton, then testing (3.1) amounts to testing 
monotonicity of quantile regression at a fixed quantile. 

Suppose that q{r\x) is continuously differentiable on X for each r G T. Then one natural 
approach is to test the sign restriction of the derivative of q{r\x). In other words, we develop 
a test of inequality restrictions using the local polynomial estimator of dq(r\x)/dx. 

One may consider various other forms of monotonicity tests for quantile regression. For 
example, one might be interested in monotonicity of an interquartile regression function. 
More specifically, let T\ < r 2 be chosen from (0,1) and write A g Tl ,r 2 { x ) = g T2 { x ) ~ 9n( x )- 
Then the null hypothesis and the alternative hypothesis of monotonicity of the interquartile 
regression function are as follows: 


(3.2) ifo ,a : A g TliT2 {x) A o for all x G X against 

Hi t a : A g T1 ,T 2 {x) > 0 for some x G X. 

The null hypothesis states that the interquartile regression function q(r 2 \x) — q{ji\x) is 
increasing on X. This type of monotonicity can be used to investigate whether the income 
inequality (in terms of interquartile comparison) becomes severe as certain demographic 
variable X such as age increases. 


3.2. Test Statistic. Suppose that we are given a random sample {(Y),Aj)}™ =1 of (Y, A). 
First, we estimate g T {x) by local polynomial estimation to obtain, say, g T {x) = e^ 7 T ((c), 
where e 2 is a column vector whose second entry is one and the rest zero, and 

n 

(3.3) %(x) = argmin 7gR r+i ^ l T (- 7 T c(A* - x)) K h (X t - x). 

1=1 

This paper applies the general approach of LSW, and proposes a new nonparametric 
test of mononoticity hypotheses involving quantile regression functions. First, testing the 
monotonicity of q(r |-) is tantamount to testing nonnegativity of g T on a domain of interest. 
Define A p (a) = (max {a, 0}) p for any real numbers a > 0 and p > 1. We consider two test 
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statistics corresponding to the two testing problems discussed in Section 3.1 for 1 < p < oo, 


(3.4) T n>2 = / A p (g T (x))w T (x)d(x,T), and 

JXxT 

Tn= A p (Ag TltVt (x))w(x)dx, ti,t 2 e (0,1), 

J x 

where A g T1 , T2 {x) = g T2 {x\) — g Tl (x i), uy(-) and w(-) are nonnegative weight functions. The 
test statistic T nj2 is used to test Lfo against H\ and the test statistic T n , A to test iL 0 ,A against 
Hi ,a ■ 

Let {(Y*,X*)} be a bootstrap sample obtained from resampling from {(Yj, Xj )}” =1 with 
replacement. Then we define 


(3.5) 7 * 0*0 = argmin 7 eR r+i ^ l r ( Y * ~ 1 T c( x * ~ x)) K h (X* - x) 

i =1 


and take g*(x) = ej^*{x), similarly as before. We construct the “recentered” bootstrap test 
statistics: 

(3.6) T* 2 = [ A p (g*(x) — g T (x)) w T (x)d(x,r), and 

JXxT 

T* a = [ A p (Ag* uT2 (x)-Ag TuT2 (x))w(x)dx, 

J x 

where Ag* (x) = g* 2 (x ) — g* x {x). We can now take the bootstrap critical values c* 2ol and 
c* A to be the (1 — a) quantiles from the bootstrap distributions of T* 2 and T* A . Then we 
define 

c* 2 , a , p = m ax{c; a , h 1/2 i 7 + d* 2 } and c A a ri = max{c^ a , h 1/2 g + a^}, 

where a 2 = E*T r * 2 and a* A = E *T* A . The (1 — a)-level bootstrap tests for the two hypotheses 
are defined as 


(3.7) 


Reject H 0 if and only if T n ,2 > c 2 
Reject H oa if and only if T n , A > c* A ar) . 

3.3. Primitive Conditions. We present primitive conditions for the asymptotic validity of 
the proposed monotonicity tests. Let V denote the collection of the potential joint distribu¬ 
tions of (Y, X) T and define V = T x V as before. We also define V = T 2 x V and define Qj 
as in (2.2). Similarly as in (2.2), we introduce the following definitions: 


Q, = {s,(.):(r,P)EV), 

Sa 9 = {Aj ri ,^(.) : (ri,r 2 , P) 6 V}, and 
Sq,i = {/ r (.|.):(r,P)6V}, 
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where / r (0|x) being the conditional density of Y t — q{r\Xi) given X t = x. Also, define 
Gf .2 = {/.(•!•) :PeV} and Q, = { 7 .(-) : P e V} . 

We make the following assumptions. 

Assumption MON 1. (i) Gf satisfies BD(S,e, 1). 

(ii) Gqj satisfies both BD(S,e,l) arid BZD(S,e, 1). 

(in) Gy satisfies BD(S,e,r + 1) for some r > 3/2. 

(iv) Gf .2 and G 7 satisfy LC. 

Assumption MON 2. (i) K is nonnegative and satisfies Assumption QR2(i). 

(ii) n - 1 / 2 h - {( 3+i/ )/ 2 } +1 + y/nh r+2 /\/log n —» 0, as n —> oo, for some small v > 0, with r in 
Assumption MON 1 (Hi). 

Assumption MON1 introduces a set of regularity conditions for various function spaces. 
Conditions (i)-(iii) require smoothness conditions. In particular, Condition (iii) is used to 
control the bias of the nonparametric quantile regression derivative estimator. Condition 
(iv) is analogous to Assumption QRl(iv) and used to control the size of the function space 
properly. Assumption MON2 introduces conditions for the kernel and bandwidth. The 
condition in Assumption MON2(ii) requires a bandwidth condition that is stronger than 
that in Assumption QR2(ii). 

Assumptions IQM1-IQM2 below are used for testing iho,A- 

Assumption IQM1. (%) Assumptions MONl(i),(ii) and (iv) hold. 

(ii) GA g satisfies BD(S,e,r + 1) for some r > 3/2. 

Assumption IQM 2. The kernel function K and the bandwidth h satisfy Assumption 
MON2. 

The following result establishes the uniform validity of the bootstrap test. Let Vo C 
V denote the set of potential distributions of the observed random vector under the null 
hypothesis. 

Theorem 3. (i) Suppose that Assumptions MON1-MON2 hold. Then, 

limsup sup P{T n , 2 > cla, v } < oi. 

n—xx> P&V o 

(ii) Suppose that Assumptions IQM1-IQM2 hold. Then, 

limsup sup P {T nA > c* A a . n } < a. 

n—>oc PePg 
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Using the general framework of LSW, it is possible to establish the consistency and local 
power of the test. Furthermore, it is also feasible to obtain a more powerful (but still 
asymptotically uniformly valid) test by estimating a contact set at the expense of requiring 
an additional tuning parameter (see LSW for details). 


3.4. Monte Carlo Experiments. In this subsection, we present results of some Monte 
Carlo experiments that illustrate the finite-sample performance of one of the proposed tests. 
Specifically, we consider the following null and alternative hypotheses: 


H 0 : \/x G X, g{x) > 0 vs. Hi : 3x G X, g{x) < 0, 


where g(x) = ^Median \Y\X —x]/dx. In the experiments, X is generated independently 
from Unif[0,1] and U follows the distribution of X 4 x N(0,0.1). 

To check the size of the test, we generated Y = U, which we call the null model. Note 
that the null model corresponds to the least favorable case. To see the power of the test, we 
considered the following alternative models: Y = m,j(X) + U (j — 1,2, 3,4,5), where 


m,i(x) = x(l — x), 
m 2 ( x ) = —O.lx, 

7773(2;) = — 0.1 exp(—50(rr — 0.5) 2 ), 

7774 ( 2 ;) = x + 0.6 exp(—10m 2 ), 

7775(2;) = [10(x — 0.5) 3 — 2exp(—10(x — 0.5) 2 )]l(x < 0.5) 

+ [0.1(x - 0.5) - 2exp(-10(x - 0.5) 2 )]l(x > 0.5). 


In all experiments, X = [0.05,0.95]. Figures [I] and [2] show the true functions and corre¬ 
sponding simulated data. 

The experiments use sample size of n = 200 and the nominal level of a = 0.10,0.05, and 
0.01. We performed 1, 000 Monte Carlo replications for the null model and 200 replications 
for the alternative models. For each replication, we generated 200 bootstrap resamples. We 
used the local linear quantile regression estimator with the uniform kernel on [— 1 / 2 , 1 / 2 ] for 
K(-). Furthermore, for the test statistic, we used p = 2 (one-sided L 2 norm) and uniform 
weight function w{x) = 1 and h G {0.9,1,1.1}. 

For the null model, the bootstrap approximation is quite good, especially with h — 1. The 
test shows good power for alternative models 1-3 and 5. For the alternative model 4, the 
power is sensitive with respect to the choice of the bandwidth. Overall, the finite sample 
behavior of the proposed test is satisfactory. 
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4. Conclusions 

In this paper, we have established a uniform error rate of a Bahadur representation for 
the local polynomial quantile regression estimator. The error rate is uniform over a range of 
quantiles, a range of evaluation points in the regressors, and over a wide class of probabilities 
for observed random variables. We have illustrated the use of our Bahadur representation 
in the context of testing monotonicity. In addition, we have established the same error rate 
for the bootstrap local polynomial quantile regression estimator, which can be useful for 
bootstrap inference. 


5. Proofs 


We also define for a, b G R|A r |, 

n Li 

C n,x,T,kifli fy = X ^ 1 {Li k) ^ ^ 


It T b'j Cfi' X ,i/’s/flh ^ 


i =1 


1=1 


and 


W y^x,r,lk,i & Cfi,x,i/ 

C n,x,T,k( a ib) = C n,x,T,k{ a ib) ~ b lp n ,x,T,k- 


K, 


h,x,ii 


Lemma QR1. Suppose that Assumptions QR1-QR2 hold. Let {hi n }^ =1 and {<52n}^Li be 
positive sequences such that 8 ln = M A/log n for some M > 0 and 62,1 < 5\ n from some large 
n on. Then for each k G Nl, the following holds uniformly over P G V: 

(i) 


(n) 

(Hi) 



sup sup I Cn,x, T,k(a, b ) - E[C n,x,T,k( a ’ 6 )] I 

a,b:\\a\\<Si n ,\\b\\<S 2 n t£T,x£S t (e) 

( h 2 n \/logw \ 

\ n x Ph d P ) 


E 


SUP \\^n,x, T ,k 
t£T,x£S t (s) 


O ^ \J\og n + h r+1 j = O ^ \/log nj . 


sup sup 

a,6:||a||<<5in,||b||<52n rST,xeS T (e) 


E[C n,x,r,k 


(« a,b )] 


(J 2 Jjn_\ 

\ni/ 2 h d / 2 ) ■ 


b T M n , r ,k{x){b + 2 a) 
2 
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Proof of Lemma QR1. (i) Define 

(5.1) S T ,k(x 1 ; x) = q k (r\xi) - 'f T ,k( x i) Tc ( x i ~ x )i 

and 


(5.2) S n , T ,k{xi’,x) =S Tjk (x i;x)l{|xi - x| < h}, 
where the dependence on P is through q k {r\xf) and 'y Tjk (xi). We also let 

(5.3) S n , T ,k(xi) = SU P sup \5 n:T , k (xi,x)\. 

a:S5 T (e) P&V 


It is not hard to see that 


(5-4) SUp reT) Xl £S T (£)\3n,T,k(Xi) | = 0(h r+1 ), 

because q k {r lay) — 7 r ,fc(^i) T c(a:i — x) is a residual from the Taylor expansion of q k (r\xi) and 
X is bounded, and the derivatives from the Taylor expansion are bounded uniformly over 
PeV. 

Let fif k x {t\x r ) be the conditional density of A x>Tt i k ,i given Xf = x'. For all x' G R d such that 
\x — x'\ < h, 

(5.5) fr^xitW) = ( d/dt)P{B H - q k (r\Xi) < t - Sn^X^ x)\Xi = x'} 

fr,k (t &n,T,k 5 ^ ) | % ) • 


Since f Tjk (-\x') is bounded uniformly over x' G S T (e) and over r G T (Assumption QRl(ii)), 
we conclude that for some C > 0 that does not depend on PeV, 

(5.6) sup sup sup f^ k x (t\x ) < C. 

tET x',xES t (e) R 


We will use the results in (5.4) and (5.6) later. 


Following the identity in Knight (1998, see the proof of Theorem 1), we write 

l r ( x - y) - l T (x) = -y ■ l T (x) + n(x, y), 
where y(x, y) =y < ys} — l{x < 0}}ds and 

l T (x) = r — l{x < 0}. 

Hence (n,x,T,k( a i b ) - E[C n,x,T,k( a , b )] is equal to 


n 

^ ^ 5) E [Gn,x,r,k($ii ^5 ^)]} 5 

2—1 
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where S* = (Bj, Xj, L { ) r , B, = (B u , ...,B Li y, and 


(5.7) 


Q'lb') — / C/n,x,T,k($ii ^5 CL)ds 


and g n ,x,T,k(Si ; A b, a) is defined to be 


HLi = k}J2 


1 V ■/ Vnh d (s6) v / VH h " J \ If /■ I' ^ , i\ ) , 


/=1 


1 ] A x,r,lk,i O' Ch,x,i/ \/ Tlfl d ^ 0 


Vnh d 


Let £/ n — b') • (tt, 6, 3?) G [ ^ln] X [ ^2n? ^2n] X <S r (<s),T G , 

£?ln = { / ^r,ln (*5 *^) • (&> 3?) G ^ln] X £ 7~} 

<?2n = {& T A T ,2n(S^) • (M) ^ [~^2n, ^n]^ 1 X <S r (V),T G T} and 

^/3n = {A rj 3 7l (*, x) . X G <S r (<s),T G > 
where A r>ln (5<; a,x) = (A XiTm - a T A r>2n (5 i ;x))^ =1 , 


Ar,2n(^5 — Ch,x,il^Tih and A t? 372 ,(*S^, x) — K] 


h,x,i • 


First, we compute the entropy bound for C/ n . We focus on £/ ln first. By Assumption 
QRl(iv), there exists C > 0 that does not depend on P 6 P, such that for any r G T, any 
(a,x) and (a',x') in [— Si n , 5i n ] r+1 x S T (e), and any t,t' G T, 

\K,in(Si',a,x) - X T ^i n (Si]a',x')\ < Cn s {\\a - a'\\ + |r - r'| + ||z—x'||}, 

for some s > 0. Observe that for any s' > 0, 

N(e', [S ln ,8 ln ] r+1 xS r (e),|| • ||) < (6 ln /e') r+2 , 

because S T (s) is bounded in the Euclidean space uniformly in r G 77 Hence there are 
C, s' > 0 such that for all s' G (0,1], 

log N(e', Gin, 11 • I loo) < -Clog (o s'/S ln )n - s ') , 

where || • ||oo denotes the usual supremum norm. Applying similar arguments to Q 2n and Q$ n , 
we conclude that 


(5.8) log N(e', g mn , || • | |oo) < C - C log {s' /n), m = 1 , 2 , 3, 

for some C > 0. 

Define for x G R, 5 > 0, 

l£(x) = (1 — mxn.{x/8, 1}) 1{0 < x} + l{x < 0} and 
l%(x) = (1 — min{(x/5) + 1,1}) 1{0 < x + 5} + l{x + 5 < 0}. 
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We also define for x,y,z E R, 


y(x,y,z) 

= zy 

$i x ,y,z) 

= zy 

fj>s( x ,y> z ) 

= zy 


i U, 


Then observe that 
(5.9) 



v>s( x >y> z ) 

< 

v{x,y,z) 

< ${x,y,z) 


\${ x ,y,z] 

) -fJ.(x,y,z)\ 

< 

\zy\l{\ x \ 

<6} 


\vs(x,y,z', 

) -fi(x,y,z)\ 

< 

\zy\l{\ x \ 

<5} 


\${ x ,y,z) - 

,y',z')\ 

< 

C{\y - y 

'I + I ---1 + 

\x — x'\/8}, and 

\ys(x,y,z) - 

y L s^\y\z')\ 

< 

C{\y-y 

'I + I ---1 + 

\ x - x'\/8}, 


for any y , y', x, x', z, z' E R. Define 

G n fi = {/B (,9l (*^i) > 9% ('Si) 1 93^i}') ■ 9m £ Gmni m lj 2, 3 j- , and 

^n ,5 = {^s( 9 i(S i ),g 2 (S i ) : g 3 ( y S i )) : g m E G mn , rn = 1,2,3} . 


From (|5.9) and (5.8), we hnd that there exists C > 0 such that for each 5 > 0 and e > 0, 

log A r [] (C'e, g% s , L p (P)) < C -C\og(e8/n) and 
log N {] (Ce,g^,L p (P)) < C — C\og(e8/n). 


(5.10) 


Fix e > 0, set 8 = e, and take brackets [rql, gfy ],..., [^,5^] and [g\ e [, g^jj ],..., \g^’ L , g^} 
such that 


,(e) „( £ ) 


:(e) ~(e) 


-(e) ~(e) 


(5.11) 


E 

E 


(ls?i( 5 'i) *^!l(^)| 2 ) < £ 2 and 


and for any g E G% and g E G r nl there exists s E {1, ...,1V} such that g^l < g < g ( fjj and 
ful <9< g ( s%- Without loss of generality, we assume that e G„ and g'^Lffsli e ■ 

By the first inequality in (5.9), we hnd that the brackets [g^ L , gflj\-, k — 1, AT, cover G n - 


Hence by putting 5 — £ in (5.10) and redefining constants, we conclude that for some C > 0 
(5.12) 

for all £ > 0. 


log N {] (C£,Gn,L p (P)) <C-Clog(£/n), 
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Now, observe that 
(5.13) 


sup 

b-.\\b\\<52n,T£T ,x&S T (e) 


b Ch,x,i-k-h,x,i 


\J nh d 


< 


c\\K\\oj2n 

\J nh d 


where c > 0 is the diameter of the compact support of K. 
For any g G Q n /L and any m > 1, we bound 

b Ch,x,i^h,x,i 


(5.14) 


\g(s,)r < 


\Jnh d 


Also, for any g G G n /L , we use (5.7) and bound E [\g(Si)\ 2 \Xi, C = k] by 




\Jnh d 


P 


b Ch,x,iKh, 3 


\Jnh d 


— A x ,T,lk,i 


® Ch,x,i^h,i 

Vnh d 


< 


b Ch tX} iKh,x,i 


\/nh d 


|Aj, Li k ? , 


where a G [—<5i n , <5i n ] r+1 and b G [— b 2n , ^ 2 n] r+1 by the dehnition of Q n . Using (5.13), we 
bound the last expression by 


Ci 




2 n 


(■ nh d ) 3 / 2 


PGP 


• sup P < max \X is — x\ < h/2 > < C 2 


s=l,...,d 


SLh d 


(nh d ) 3 / 2 ' 


for some constants Ci,C 2 > 0 . Therefore, by (5.13), for some constants Ci,C 2 > 0 , it is 
satisfied that for any m > 2 , 

m —2 


sup EOsOSOH <Ci 

PGP 


J2n 


=V 


, .supEOgfSj)! 2 ] <CX-h 2 n , 
\/nh a J pgP 


where 

(5.15) 


br, = ~ 2ri and = 


?3/2 


\Jnh d 


_ u 2n 

n = nv*h d /*' 


By (5.9), (5.11), and (5.13), and the definition of b n and ,s n in (5.15), there exist constants 
Ci, C 2 > 0 such that for all m > 2, 

E (l g l S(S t ) - sStSOr) = E (l gl%(S { ) - a$(S,)Mg$(S,) - g« (SOI 2 

< C.-5r 2 -E(|gS(Si)-9S(S.)| 2 ) 

< 2 C 1 .C" 2 -E(|gS(S,)-g<:i(S i )| 2 ) 
+2C 1 .C" 2 -E(|g<)i(S i )-g2(5 i )| 2 ) 

< 2C 2 ■ 5™- 2 ■ fc 2 + b 2 n e} < 2 C 2 • b™- 2 • £. 


(The term b 2 e is obtained by chaining the second and third inequalities of (5.9) and using 
the fact that 5 = e and the uniform bound in (5.6). The last inequality follows because 
b n —> 0 as n —> 00 .) We define e = e 1 / 2 and bound the last term by Csb™~ 2 £ 2 , for some 
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C3 > 0, because b n < 1 from some large n on. The entropy bound in (5.12) as a function of 
e remains the same except for a different constant C > 0 there. 


Now by Theorem 6.8 of Massart (2007) and (5.12), there exist Ci,C 2 > 0 such that 


(5.16) 


sup E 

P£V 


SUP I (n,x,T,k( a i b ) “ E tC n,x,rA a ’ 

a,6:||o||<<5i„,||i6||<52n,TeT,ie5r(£) 


< Ciy/n j yJnA |- log (^j jofe + Ci(b n + s n ) log n 

tilnVIogn 


< c 2 s n y/n log n + C 2 b n log n = O 


7iV 4 /i d / 4 


where the last equality follows by the definitions of b n and s n in (5.15) and by Assumption 
QR2(ii). 

(ii) Define A T)4n (S' i ; x) = A X)Ti/fcii and = {l T (K, 4 n(-]x)) : r G T,x G <S r (£)}, and C k , 2 = 
{A T ,2n(s x)X Tj 3 n('] x ) : t E T , x G <S r (£)}. We write 

'*Pn,x,T,k \jl^n,x,T,k E [f/N,a;,T,fc]} T E • 

The leading term is an empirical process indexed by the functions in C k = £ k ,i • £ k , 2 - 
Approximating the indicator function in l T by upper and lower Lipschitz functions and 
following similar arguments in the proof of (i), we find that 

SUp log N[](£,C k ,L p (P)) < C - C log £ + C log 71, 

Pev 

for some constant C > 0. Note that we can take a constant function C as an envelope of 
C k . Then we follow the proof of Lemma 2 to obtain that 


E 


SUp ||i/U,x,T,fc E ['0n;X,r,ft] 
reT,xe 5 r (e) 


= 0(y/\ogn), uniformly in P £ V. 


By using (5.4) and (]5.5|), we find that 
(5.17) 


E [V’n,*,r,fc] = 0(h r+1 ) = o(A/log n), uniformly in P G V, 

because Vnh d h r+l /\/log n —* 0 by Assumption QR2(ii). 

(iii) Recall the definition of g n ,x,r,k(Si', s, b , a) in the proof of Lemma QRl(i). We write 

E[C£*, T ,fc( a » b)]=n E [g n>x , Ttk (Si, s, b, a)} ds. 

Jo 

Using change of variables, we rewrite 


E [gn,x,T,k (5*1) 5, f), ct)] ds k P {Li k|• (f) n (yXi^ a^b '), 
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where 


<j) n (Xi,a,b ) = 


n (b+a) T c htXi i/VnhA ( ^ ^ _ S^Xf, x)\X t ) 

i a T Ch 1 x,i/'/nh d I F r ,k ( fin,T,k(Xij X~) |Aj) 


du ' Xh t x,i 


By expanding the difference, we have 


4>n(Xi ; a, b ) = 


-(b+a) T c h , X 'i/\/nh d 


udu • fr.k ( ^n,T,fc(-^Q) *£)|Aj) ' “h Rn,x,i(fli ^)> 


where R n ,x,i( a ->b) denotes the remainder term in the expansion. As for the leading integral, 

r(b+a) T c htX ,i/VnhA ^ 

/ = { bTc h,x,i°h, x ,i(b +2a)}. 

J a T Ch, x ,i/^nh d znri 

Hence, for any sequences a n ,b n , we can write E[<^ xrfc (a n , b n )] as 

h) T n h~ d k'E [P {R = k\Xi} f Tik {-S ntTtk (Xi, x)\Xi) c h}Xti cl xi ■ K h> X)i ] (b n + 2 a n ) 
+nkE \P {Li = k\Xi} R n ,x,i{P"m bn)] 

= 2 ^ n M n ,T,k{x)(b n + 2a n ) 4- n/cE [H {Aj = A, - |Aj]- Rn,x,i(flni bn)\ ■ 

We can bound 


nk |E ]P {Li /u|Wj j- Rn,x,iiflni bn 

I r{bn+a n ) T c htXti /Vnh d 


< G\ nkE 


' aL c h,x,i/Lnh d 


ZL dll * K}i,x,i — 


C 2 b n al 
n 1 / 2 h d / 2 ’ 


where Ci > 0 and C 2 > 0 are constants that do not depend on n or P G V. 


Proof of Theorem 1. (i) Let 

(5.18) ZLn,x,r — ■M n , T j^(x^^n,x,T,k) 

r ifn,x,T,k(p') — b 'ipn,x,r,k ~l“ b M n ^ T ,k (^) bj2 , and 

r lpn,x,T,k{fl')b^) — 'lfn,x,T,k(0' “I ~ b') ^Pn,x,r,k(^^) • 

For any a G RJ Ar l, we can write 

(5.19) 'tpn,x,T,k('U"n,x,T'> & ^n,a;,r) ^n,x,T,k (^) VVi,#,T,fc(^n,:r,T) 

{fl ^n,a?,r) J ^n,r,fc(^) (^ ^n,tc,r) /2 

^ C'lHa ^n,a;,r|| 5 

where Ci > 0 is a constant that does not depend on r G T, a: G 5 r (e) or P G P. The 
last inequality uses Assumption QR1 and the fact that K is a nonnegative map that is not 
constant at zero and Lipschitz continuous. 
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Let 

i,i,r = VnJpH(% k {x) - 7 T ,fc(x)), 

where a; G S T {e) and r G T. Since C n,x,T,k(u n , x ,T,b) is convex in b, we have for any 0 < 5 < l 
and for any b G Rj Ar l such that ||6|| = 1, 

(5.20) (^/0Cn,x,T,A; (u n ,x,Ti Ify ^ Cn,x,T,ki^n,x,r j 8b) 

— ^Pn,x,T,k(W“n,x,Ti fob'} A n k(8), 


where 


A n> k(8) — SUp \Cn,x,T,k(^n,x,Ti 8b) ^n,x,T,k(^n,x,Ti 8b)\. 

beRl- 4 ’'l:||6||<l 

Therefore, if \\u n , x , T ~ u n ^ T \\ > 8, we replace b by u^ XT = (u njX}T - u n , x ,i 


u 


n,x,r 


— U 


n,x,T | 


and l by \\u n ,x,T — w n ,x,r|| in (5.18), and use (5.20) to obtain that 
(5.21) 0 ^ Cn,x,r,fc(^n,x,T) ||Kn,x,r ^"n,x,T | |^n,x,r) 

^ Cn,x,T,k(^n,x,Ti 8'8 nxT ) 

^ '*Pn,x,T,k (^n,x,rj 8u nxT ) ^n,fc(<f) 




— A n ,k(5) — Cih 2 — A n< k(5), 


for all 5 < ||fi n)a , )r — ft n , x , T ||, where the first inequality follows because C n,x,T,k(u n , x ,T, ll^n,®,? 


u, 


hXjT \\b) is minimized at b = u^ XT by the definition of local polynomial estimation, the 


second and the third inequality follows by (|5.20|), and the fourth inequality follows from 

2 = 1 . 


(5.19), and the last equality follows because \ \u: 


A 

n.x.r I 


We take large M > 0 and let 
(5.22) 


5 m = Mi/logn and S 2n = • 


If & 2 n < || u n ,x,T ~ u n , x ,T ||, we have 


Cl — An t k(S-2n), 


from (5.21). We let 


In — 1 ^ SUp ||^n,®,r|| — M8\ n 
r£T,x£S T (e) 


Then we write 


(5.23) P\ inf \\u n ,x,r ~ u n . X:T \\ 2 > 8\ n f < P {A nifc (<5 2n )l n > 5% n } + E [1 - l n ]. 
reT,xe5 T (e) 
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Now, we show that the first probability vanishes as n — » oo. For each b e using the 

definition of i> n , x ,r,k{^n,x,T, b) = ^ n ,x,T,k{u n ,x,T + b) - '0n I x,T,fc(wn,x J r), we write 

;,T,k(U"n,x,Ti b'j V , n,x,r,fc(lln,a;,r T &) ^n,i,r,i;(^n,i,r) 

b ifi n,x,T,k T (u n,x,T “I - -^-n,x,T (^n,x,r T &)/2 ^n,x tT -^ n , X ,T'^n,x,T 

b 'l/jn,x,r,k "F b M n ^ XT b/2 T b Mn,x,T^n,x,T 
= b T M n , XiT b/ 2. 

Therefore, 

C n,x,T,k{u nt x,Tib) — r ipn,x,T,k(‘U ntXjT ,b) = Cn,x,T,k{^' n , x ,' r t b) — E [Cn,x,r,fc (^n,x,r j ^)] 

+E [C^*,T,fc(“n,*,r > &)] - b T M n>XtT b/ 2 + 6 T ^ n , Xi T)fc 

= Cn,x,T,k('b’n,x,Ti b) — E [Cn,x,T,fc(^h»,x,Tj ^)] 

+E [Ct*,T,fc(“n,*,r > 6 )] - b T M n ^ r (b + 2u U}XtT )/2. 


By Lemma QRl(i), 


sup 


sup 


= O 


reT,xe*S T (e) 6eRl Ar l:||6||<52„ 

5 2nVlogn 


|Cn,x,r,fc(^rc,x,T> ^ [Cn,x,r,fc( M n,x,r j ^)] 


n !/ 4 / z d/ 4 


by the definition in (5.22). And by Lemma QRl(iii), 


sup SUp |E [Cn,x,r,fc iPn,x,ri ^)] ^ ^^n,x,r{b T 2'U rli3 ; )T )/2| 

r£T,xeS T {e) feeRl A ’-l:||6||<5 2rl 
/ 5 2n log n \ 

V 71W/2 J ’ 


by the definition in (5.22) and Assumption QR2(ii). Thus we conclude that 
(5-24) |A„, t («| =oJ S ^°^ 


where the last Op term is uniform over P e V. We deduce from (5.24) that 


sup P <( A nifc (6 2n )l sup \\Un,x,r\ \ < $ln } > ^2n } 0 as 71 ->■ OO 

P&V I I tgT,x£S t {s) 

and as M "f oo. The proof is completed because 


SUp P \ sup ||Un,x,rl 

P&V I t£T,x£S t (e) 


> 5In r ~^ O’ 
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as n — y oo and as M \ oo by Lemma QRl(ii). Thus, we conclude from (5.23) that 

( \f\ogn \ 


^n,x,T "^n,a;,r|| Op 


, V- uniformly. 


yn 1 / 4 /^/ 4 / 

Now the desired result of Theorem 1 follows from the fact that 

M n,x,T^‘i’n,x,T,k = 0(h + ) = O f 1/4 ^ 4 j 


which follows by Assumption QR1, (5.17), and Assumption QR2. 


□ 


As mentioned in the main text, the convergence rate in the asymptotic linear representa¬ 
tion is slightly faster than the rate in Theorem 2 of Guerre and Sabbah (2012). To see this 
difference closely, Guerre and Sabbah (2012) on page 118 wrote, for fixed numbers x and y, 


fX+y 


l T (£i + X + y) - l r (£i + x) - y ■ l T (Si + x)= (l{e* < t} - 1 {e* < 0 }dt, 


where Ei is a certain random variable with density function, say, / which satisfies 11/| |oo < oo. 
From this, Guerre and Sabbah (2012) proceeded as follows: 


E 


l T (£i + x + y) - l T (ei + x) - y ■ l r (£i + x) 


rx+y 


< 2\y\ / E [(1 {£i < t} — 1 {Ei < 0}) 2 ] dt 

J X 

rx+y 


< 2 


\t\dt < 2\y\ (|x| + 


On the other hand, this paper considers Knight (1998)’s inequality and proceeds as follows: 


E 


l T {£i + x + y) - l T (£i + x) - y ■ l T (si + x) 


< \y\ 2 I E \l{£i + x < yt} - 1+i + x < 0}| dt 

Jo 

< \y\ 2 P{-\y\<ei + x<\y\}<2\\f\\ oc \y\ 3 . 


Note that when \y\ is decreasing to zero faster than |x|, the latter bound is an improved 
one. The tighter L 2 bound gives a sharper bound when we apply the maximal inequality of 
Massart (2007) which yields a slightly faster error rate. (Compare Proposition A.l of Guerre 
and Sabbah (2012) with Lemma QR1 where 5\ n and £ 2 ™ in Lemma QR1 correspond to tp 
and t t in Proposition A.l respectively.) 
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Proof of Corollary 1. First, we write 


^■n,r,k( X )' l l J n,x,r,k ~ M n ,r,k( X ) ^Pn,x,T,k + ^ ; (Pn,i( x , 


t) -E[p nti (x,r)}) , 


where 


iK, 


h,x,i- 


Pn,i(%i I") 1 {Ci l> } ^ ^ (^x,r,Zfc,i) It ipT,lk,i)^ Ch,x,i 

1=1 

It suffices for Corollary 1 to show that 

1 " / InP’ 1 / 2 u \ 

^T,k0*0^= ^ “ E bn,i(®,T)]) = Op . 

Using the definition in (5.1), writing S XyT: k,i = S Tj k(Xi]x), and using Knight’s identity, we 
write 

Li 

1 {L/i A } ^ ^ f/ T \ £r,lk,i T *^) ) ( pT,lk,i)J Ch,x,iXfi,x,i 


1=1 


i {Lj a } s XtTj k,j 'y ] 


Li / „l / I I _5 „ 

I ^ ] >Z/ T,lk,i — u x,r,k,i’ x> 

1 < 0 } 


] Ch,x,i-Kfi,: 


1=1 


Following the same arguments in the proof of Lemma QRl(i), we deduce that 

/ log 1 / 2 n \ 

\ rpl 4 h d / 4 J 


M n,lA x ^~rXfYl (aw( x > t ) - E Ki( x - r )D = Oi 

\nn i=1 




uniformly over r G T, x G «S r (e:) and over P EV. 


□ 


For z = (x,t) £ Z and a, b e R, we define 

1 LI A 


c,x, T y«.(-) = E 1 (n = nE 

We also define 


i= 1 


Z=1 


x,r,lk,i 

It (A 


a + b V C h,x,i /\/^ 

X,T,Ifc,i - iTc l A ,/VS 


K\ 


h,x,i' 


Cn*x,r,k(^b) = C,x,r,k( a , b ) ~ ^ ^n,x,T,k- 

The following lemma is the bootstrap analogue of Lemma QR1. 


Lemma QR2. Suppose that Assumptions QR1-QR2 hold. Let {<5i n }^ =1 and { 82 n}ff= i be 
positive sequences such that 8 \ n = M A/log n for some M > 0 and 5 2n < 8 \ n from some large 
n on. Then for each k e Nl, the following holds uniformly over P e V: 
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(i) 


(a) 


E* 


a,b: 


sup 

MI<‘5ln,||&||<<Ln 


sup 

r&T,x&S 1 


(e) 


Ip A* | 
IS n,x,r,k ' 


a, b ) 




— Op 


4(V 1q gn \ 

n l / A h d / A ) 


sup sup 

0,6:||o||<5ln,||6||<62n t£T,x£St(£) 


E’ICUM)] 


— Op 


^Vjog ~n\ 

n 1 l A h d l A J 


b T M n ^ k (x)(b + 2 a) 
2 


Note that the convergence rate in Lemma QRl(ii) is slower than that in Lemma QRl(iii). 


Proof of Lemma QR2. (i) Similarly as in the proof of Lemma QRl(i), we rewrite £^* Tfc (a, 6 ) — 
E lCt;,r,fc(M)] as 

n 

Y {Gn, x ,r,k{S*-a, b) — E [G mk {S*- a, 6)]} , 

2 = 1 

where S* = X* T ) T . Let n = (x,r, s,a,b) and n„ = 5(e) x T x [ 0 , 1 ] x [— 8 \ n , 5i n ] r+1 x 
[— 82 m ^ 2 n] r+1 , where 5(e) = {(x,r) e X x T : x G 5 r (e)}. Using Proposition 2.5 of Gine 
(1997), 


E 


E* 


SU P sup \( n * :T , k ( a , b) - E* [C n> * T)fc (a, 6 )] I 

a,6:||o||<5i„,||6||<<52„ TeT,xeS T (e) 


< C E 


Ejv, ( sup 

iren n 


^ ' (Li j 1 ) \ 9n,x,r,k(Sii Sj 6 , (?) ^ ^ 9n,x,r,k(Sii 5 , 6 , (?) 

I ^ 


2=1 


2=1 


where {iVj }” =1 are i.i.d. Poisson random variables with mean 1 independent of {(P)" 1 ", Xj ) T }^ X , 
E^ denotes expectation only with respect to the distribution of {iV,}” =1 , and g n , x ,T,k( S s , b, a ) 
is as defined in the proof of Lemma QRl(i). Here the constant C > 0 does not depend on 
P G V. We can bound the above by 


CE 


sup 

7ren n 


^ ^ (Xj 1) (9n,x,T,k(Sii 5, 6, (?) E [(Jn,x,T,k (5j, S, b , (?)]) 


2=1 


+CE 


- a 


i=l 


x E ( sup 

Ten n 


n 


^ ^ 5 f n,rr,r,A:(*S f 2 5 ® ^)] 


2=1 
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The leading expectation is bounded by 0(52n\/logn/ {n l ^h d ^ A )) similarly as in the proof of 
Lemma QRl(i). And the product of the two expectations in the second term is bounded by 


0(Vn) x —E sup 


n 


, TTGlIn 


^ ^ {Qn,x,T,k[Si) 5 , 6 , Cl ) E [c/n,x,T,k(^i') ^0] } 


= O [52 / n 2 ^logn/(7i 3/4 h d/ 


i=1 
4' 


where the constant C > 0 does not depend on P e V, and the last equality follows similarly 
as in the proof of Lemma QRl(i). 

(ii) Note that 

(5.25) E*[C„ A ;. Til (a, 6)] = E*[C„ A ; Tii (a, 6)] - E[C£, T ,*(a, b )] + E[^ IiTit (a, 6)]. 

The difference between the first two terms on the right hand side is 

n ( 

P y nWh*/* 

uniformly in P £ V, as we have seen in (i). We apply Lemma QRl(iii) to the last expectation 
in (5.25) to obtain the desired result. □ 


Proof of Theorem 2. The proof is completed by using Lemma QR2 precisely in the same 
way as the proof of Theorem 1 used Lemma QR1. While the convergence rate in Lemma 
QR2(ii) is slower than that in Lemma QRl(iii), we obtain the same convergence rate in the 
bootstrap version of (5.24). Details are omitted. □ 


Lemma MIQ 1. (i) Suppose that the conditions of Theorem 3(i) hold. Then Asumptions 
A1-A3, A5-A6, and Bl-Bf in LSW hold with the following definitions: J = 1, r n = y/nh 3 , 

v U)T {x ) = ejy T (x), and 

Z) = -l T ( Yi — ~fj(x) ■ H ■ c (z)) ej M~l(x)c (z) K (z). 


(ii) Suppose that the conditions of Theorem 3(H) hold. Then Asumptions A1-A3, A5-A6, 
and Bl-Bf in LSW hold with the following definitions: J — 1, r n = y/nh 3 , 

v n , T ( x ) = eJ{j Tl (x) - j T2 (x)}, and 

fin,x,T j z) — 0^n,x,Ti (^ij z) (%n,x,T2 zfi 

where the set T in LSW is replaced by T x T here, and 

a n ,xA Y i’ z ) = ~~ l T (Yi ~ lA x ) ■ H ■ c (z)) ejM~((x)c (z) K (z). 
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Proof of Lemma MIQ. (i) First, Assumption Al in LSW follows from Theorem 1, with the 
error rate in the asymptotic linear representation fulfills the rate Op(h 1//2 ) by the condition: 
r > 3/2. Assumption A2 follows because /3 ntXlT (Yi, z) has a multiplicative component of 
K(z) having a compact support. As for Assumption A3, we can use Lemma 2 in LSW 
in combinations of Lipschitz continuity of /.(• |-) and j.(-) to verify the assumption. As we 
take o~r,j(x) = &*j(x) = 1, Assumptions A5 and B3 are trivially satisfied with the choice 
of a U)T) j{x) = 1. Assumption A6(i) is satisfied because f3 n , x , T ,j is bounded. Assumptions As¬ 
sumption B1 follows by Lemma QR2, and Assumption B2 by Lemma 2 in LSW. Assumption 
B4 follows by Assumption MON2(ii). (ii) The proof is similar and details are omitted. □ 

Proof of Theorem 3. The results follow from Theorem 1 from LSW combined with Lemma 
MIQ1. Details are omitted. □ 
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Table 1 . Results of Monte Carlo experiments 



Null model 

Alternative Model 1 

Bandwidth 

Nominal level 

Nominal level 

(h) 

0.10 

0.05 

0.01 

0.10 

0.05 

0.01 

0.9 

0.111 

0.057 

0.020 

0.995 

0.975 

0.780 

1.0 

0.100 

0.048 

0.007 

0.980 

0.920 

0.660 

1.1 

0.077 

0.036 

0.005 

0.905 

0.755 

0.375 


Alternative Model 2 

Alternative Model 3 

Bandwidth 

Nominal level 

Nominal level 

(h) 

0.10 

0.05 

0.01 

0.10 

0.05 

0.01 

0.9 

0.985 

0.965 

0.800 

0.990 

0.970 

0.660 

1.0 

1.000 

0.995 

0.935 

1.000 

0.990 

0.820 

1.1 

1.000 

1.000 

0.985 

1.000 

0.990 

0.835 


Alternative Model 4 

Alternative Model 5 

Bandwidth 

Nominal level 

Nominal level 

(h) 

0.10 

0.05 

0.01 

0.10 

0.05 

0.01 

0.9 

1.000 

0.960 

0.540 

0.995 

0.980 

0.845 

1.0 

0.885 

0.645 

0.175 

0.995 

0.995 

0.985 

1.1 

0.310 

0.120 

0.010 

0.995 

0.990 

0.935 
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Figure 1. True Function and Simulated Data 

Simulated Data and True Function 



Simulated Data and True Function 



Simulated Data and True Function 



Note: Each figure shows the true function and simulated data {(Yi,AT) : 1 = 
1,,n = 100} being generated from \\ = nij(Xi) + U tl where X ~ Unif[0,1] and 
Ui ~ X 4 x N(0,0.1 2 ), and ?n 0 (x ) = 0, m 1 (x) = x(l — x), and m 2 {x) = —0.1a:, 
respectively. 
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Figure 2. True Function and Simulated Data 

Simulated Data and True Function 



Simulated Data and True Function 



Simulated Data and True Function 



Note: Each figure shows the true function and simulated data {(Y),AT) : 1 = 
1 = 100} being generated from 1} = rrij(Xi) + Ui, where X ~ Unif[0,1] 

and Ui ~ X 4 x N(0,0.1 2 ), and m 3 (x) = —0.1 exp(—50(a; — 0.5) 2 ), m 4 (x) = 
x + 0.6exp(—10x 2 ), and m 5 {x) = [10((a; — 0.5) 3 ) — 2exp(—10((a; — 0.5) 2 ))]l(a: < 
0.5) + [0.1 (a; — 0.5) — 2exp(—10((a: — 0.5) 2 ))]l(a: > 0.5), respectively. 


















